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Summary; 

The  study  presented  a  univariate  stochastic  modeling  algorithm 
(USA)  for  purposes  of  the  Box-Jenkins  modeling  of  one  variable  time 
series  via  a  fully  automatic  process .  The  algorithm  was  programed 
for  the  computer  and  tested  empirically.  It  was  found  that  USA  fore«- 
casts  were  not  statistically  different  than  those  generated  by  conven- 
tional modeling  procedures. 

The  results  indicate  that  there  is  evidence  that  the  Box-Jenkins 
modeling  process  can  be  fully  automated.  It  is  felt  that  the  use  of 
USA  can  (1)  increase  the  reproducibility  of  research,  (2)  save  time, 
(3)  be  used  as  a  "black  box"  by  the  statistically  untrained,  and  (4) 
make  explicit  the  assumptions  employed  in  the  modeling  process. 


In  recent  years  there  has  been  an  increased  emphasis  on  the  use  o£ 
the  Box  and  Jenkins  (1970)  method  of  modeling  univariate  stochastic,  time 

series.   Notwithstanding  this  emphasis,  the  use  of  the  method  involves 

2 
the  making  of  sequence  of  highly  subjective  decisions.   The  purpose  of 

the  present  study  is  to  present  an  algorithm  which  quantifies  this  sub- 
jective sequence.  Like  any  formal  modeling  process  the  algorithm  does 
not  replace  the  need  to  make  decisions,  but  simply  quantifies  them  and 
makes  them  automatically. 

The  paper  is  presented  in  four  sections.  Section  one  discusses  the 
potential  benefits  of  using  an  automatic  univariate  stochastic  algorithm 
(USA).  Section  two  presents  the  USA  followed  by  an  empirical  validation 
in  section  three.  Section  four  presents  a  summary  and  conclusions,, 

1.0  Benefits  Associated  with  the  USA 
1.1  Subjectivity  and  Reproducibility 

As  discussed  by  Dharan  (1977)  one  of  the  major  requirements  or  char- 
acteristics of  scientific  research  is  reproducibility.  However,  "while 
empirical  research  in  areas  other  than  time  series  analysis  may  contain 
some  subjectivity,  none  probably  relies  on  subjectivity  as  much  as  the 
Box-Jenkins  procedure  does"  (Dharan,  1977,  p.  13).  The  result  is  that 
different  researchers  are  likely  to  fit  different  models  to  the  same 
data.  However,  the  use  of  a  programmed  algorithm  will  always  produce 
the  same  model,  given  a  particular  time  series,  regardless  of  the  re- 
search. 
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1.2  USA  and  Required  Earnings  Forecasts 

There  is  a  reasonable  indication  that  earnings  forecasts  will  be  re- 
quired in  external  reports  in  the  not  too  distant  future  (see  Wall  Street 
Journal,  1977) .  This  will  probably  lead  to  an  emphasis  on  the.  disclosure 
of  the  procedures  and  assumptions  used  to  generate  the  forecasts*  The 
use  of  a  USA  type  method  makes  possible  the  disclosure  of  a  precise*  des- 
cription of  the  procedures  used. 

1.3  USA  and  the  Untrained  Individual 

One  benefit  of  USA  is  that  it  can  be  programmed  on  the  computer  for 
use  as  a  "black  box".  This  means  that  it  can  be  used  by  the  nonstatisti- 
tion  auditor  in  the  forecasting  of  account  balances  or  by  management  in 
the  decision  making  process. 

1.4  USA  and  Time  Savings 

Programmed  on  the  computer  USA  can  model  literally  thousands  of 
time  series  models  in  a  very  short  period  of  time.  For  example,  the 
author  found  that  its  use  on  the  CYBER  175  required  an  average  of  .333 
CPU  seconds  for  modeling  and  forecasting  the  EPS  of  firms  containing 
50  observations. 

2.0  A  Proposed  Algorithm 
Discussion  of  USA  can  be  decomposed  according  to  the  main  phases 
of  the  modeling  process:   (1)  determining  the  appropriate  degree  of 
differencing,  (2)  model  identification,  (3)  nonlinear  least  squares 
estimation,  and  (4)  diagnostic  checks  for  model  adequacy.  The  focus 
will  be  on  stages  (1)  and  (2)  since  they  are  the  most  difficult  phases 
to  algorithmize. 
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2.1  Determining  the  Appropriate  Degree  of  Differencing 

The  purpose  of  differencing  is  to  produce  a  transformed  series 
which  is  approximately  constant  in  mean*  The  criterion  given  by  Box 
and  Jenkins  (1970)  is  that  a  consecutive  difference  should  be  taken  if 
the  autocorrelation  function  does  not  dampen  out  "rapidly".  The  deter- 
mination of  what  is  rapid  is  a  subjective  and  pivotal  point  in  the  model- 
ing process.  The  same  criterion  holds  for  a  seasonal  difference j  the 
autocorrelation  function  examined  at  only  seasonal  lags  must  dampen 
out  quickly. 

Rule  for  Taking  a  Consecutive  Difference.  Based  on  the  above  cri- 
terion it  was  decided  to  make  the  following  formal  rule:  If  the  number 
of  initial  consecutive  significant  autocorrelations  is  greater  than  an 
arbitrary  but  fixed  constant  K,  then  the  series  Zfc  is  formed  from  taking 
a  first  difference  on  the  original  series  X  (see  note  3).  If  no  first 

f 

difference  is  taken  then  Z_   ■  X.. 

t    t 

Rule  for  Taking  a  Seasonal  Difference.  If  the  number  of  initial 
consecutive  autocorrelations  of  Z  (examined  only  at  seasonal  lags)  is 
greater  than  L  (an  arbitrary  but  fixed  constant)  then  a  series  Y  is 
formed  by  taking  a  seasonal  difference  on  Z  (see  note  4).  If  no  seasonal 
difference  is  taken  then  Y  ■  Z  .  It  is  important  that  the  two  rules  for 
consecutive  and  seasonal  differences  are  applied  in  the  order  given.  This  is 
because,  if  a  first  difference  is  not  taken  (when  necessary)  it  is  possible 
that  there  will  be  significant  the  seasonal  lags  due  to  the  need  for  a 
first  difference.  The  taking  of  a  first  difference  will  remove  these 
confounding  significant  autocorrelations. 


2.2  Model  Identification 

The  basic  identification  is  executed  by  the  examination  of  the  sample 
autocorrelation  and  partial  correlation  functions  (ACF  and  PCF,  respec- 
tively) of  the  series  Y  .  The  sample  functions  are  compared  to  theorem 
ical  functions  known  to  be  associated  with  various  ARTMA  models.  The 
next  step  is  to  select  a  model  which  has  the  theoretical  function  most 
closely  associated  with  the  sample  pattern . 

In  the  development  of  USA,  ACFs  and  PCFs  are  examined  for  the  possible 
existence  of  three  basic  types  of  patterns: 

(1)  Sign  Alteration;  If  an  ACF  or  PCF  alternates  in  sign  for 
a  number  of  initial  consecutive  correlations  that  exceeds 
an  arbitrary  but  fixed  constant  m,  then  that  ACF  or  PCF 
is  assumed  to  alternate  in  sign. 

(2)  ■  Unidirectional  Tapering:  If  an  ACF  or  PCF  has  a  number  of 

initial  consecutive  correlations  that  exceeds  an  arbitrary 

but  fixed  constant  n,  then  that  ACF  or  PCF  is  assumed  to 

6 
taper. 

(3)  Seasonal  Pattern:  If  an  ACF  or  PCF  has  a  number  of  initial 
consecutive  seasonal  correlations  that  exceeds  an  arbitrary 
but  fixed ^constant  p,  then  that  ACF  or  PCF  is  assumed  to 
have  a  seasonal  pattern. 

2.2.1  Introduction  of  Notation 

For  purposes  of  notation,  two  zero-one  variables  are  defined  to  de- 
note the  existence  of  patterns  in  the  ACF  or  PCF.  These  are  APTRNID 
and  FPTRNID  and  are  set  to  one  if  at  least  one  of  the  above  three  patterns 
occur  in  the  ACF  or  PCF  respectively  (otherwise  set  to  zero). 
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In  addition,  let  A,  denote  the  autocorrelation  coefficient  at  lag  k 
and  P.  denotes  the  partial  correlation  coefficient  at  lag  k«,  Also,  A^ 
and  F,  will  be  defined  to  be  equal  to  one  whes  significant  (and  zero 
otherwise).  Finally,  let  the  variable  ACHECH)  (or  PCNECID)  equal  the 
number  of  initial  consecutive  significant  autocorrelations  (partial 
correlations) . 

2.2,2  Operation  of  the  Algorithm 

The  USA  model  identification  procedure  selects  a  model  based  on  the 
values  that  the  above  defined  variables  take  on.  Table  1  presents  a  list 
of  models  with  the  associated  values  of  the  above  variables  required 
for  identification.  The  procedure  used  is  to  sequentially  proceed  down 
the  list  until  a  model  is  selected.  The  next  step  is  to  estimate  para- 
meters for  this  model  and  then  apply  diagnostic  checks  to  the  residuals. 
If  the  model  is  accepted  then  forecasting  is  done.  If  the  model  is  un- 
acceptable then  Table  1  is  again  used  to  model  the  residual  series.  The 
model  for  residual  series  is  then  added  as  a  factor  to  the  previous  model 

a 

and  the  process  continues  until  a  model  passes  the  diagnostic  checks. 
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Table  1 


A  Description  of  Procedures  Employed 
to  Identify  Models  by  USA 


Model (s)  q 
Identified9 

Associated  Correlation  Structure  Required 
for  Identification 

Regular  AR 
of  order  one 

A.  -  1      P,  -  1      APTRNID  «  1 

P^  -  0       PPTRNID  -  0 

Regular  AR 
of  order  two 

A.  -  1                  APTRNID  ■  1 

A^  -  1                  PPTRNID  »  0      PCNECIB  >  0 

Regular  MA 
of  order  one 

A.,  »  1       P,  -  1       APTRNID  -  0 
A^  -  0                  PPTRNID  -  1 

Regular  MA 
of  order  two 

P,  -  1       APTRNID  -  0      ACNECID  >  0 
p£  -  1       PPTRNID  -  1 

Mixed  AR-MA 
first  order 

A     -  1       P.  -  1       APTRNID  -  1 

PPTRNID  «  1  - 

Seasonal  AR 
of  order  one 

A,  -  1       P.  •*  1 

4  -  1          P8  -  ° 

Seasonal  MA 
of  order  one 

4-°      ps  - x 

Try  Both 
(first  order) 
AR  and  MA 

[^  -  1  and  A^   -  0]   or   [P  -  I  and  ?z   -  01 

Try  Both 
(second  order) 
AR  and  MA 

[ACNECID  =1]   or   [PCNECID  «  1] 

Try  Both 
(first  order) 
AR  and  MA 
Seasonal 

A-  -  0       P  -  0 

A4  -  1          p;  -  1 

Try  Both  AR 
and  MA  with 
parameters  at 
significant 

lag3 

This  identification  is  U3ed  if  none  of  the  above 
identifications  are  made. 
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3.0  An  Empirical  Test  of  USA 
A  sample  of  EPS  (60  quarters)  was  taken  for  30  firms.    For  each 
firm  50  quarters  were  used  for  modeling  (using  both  USA  and  conventional 
methods)  and  the  remaining  10  quarters  were  used  for  comparing  the  USA 
and  conventional  modeling  methods. 

For  each  of  the  10  periods  on  the  forecast  horizon,  a  computation 

was  made  as  to  the  number  of  firms  for  which  USA  came  closer  in  absolute 
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value  to  the  actual  EPS  than  for  the  conventional  method.    (This  computed 

number  will  henceforth  be  referred  to  as  D.,  k  ■  1,  10.) 

Under  the  null  hypothesis  of  no  difference  between  USA  and  the  manual 

method,  we  would  expect  each  D,  to  be  equal  to  15  (i.e.,  1/2  of  the  30 

13 
firms).    Table  2  presents  a  test  of  this  hypothesis. 


Table  2 

A  Comparison  of  USA  vs.  the 
Conventional  Method  of  ARIMA  Modeling 


Period 

on  the  Forecast  Horizon 

1 

2 

3 

4 

5  16  17     8 

1 

9   !  10 

Actual 
Frequency 

12 

13 

15 

17 

18 

19 

15" 

16 

■jii  .... 
13    16 

Expected 
Frequency 

15 

1  U 

15 

15 

1  15 

15 

15 

13 

13.5 

14 

Chi 

Square  (9  df) 

3.66 

Note  that  the  test  statistic  of  3.66  is  substantially  less  than  re- 
quired 14.68  needed  to  reject  the  null  (assuming  a  ■»  .1).  The  implica- 
tion is  that  there  is  no  difference  between  the  USA  forecasts  and  those 
generated  by  conventional  modeling  procedures. 
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4.0  Summary  and  Conclusions 

The  study  presented  a  univariate  stochastic  modeling  algorithm  (USA) 
for  purposes  of  the  Bos-Jenkins  modeling  of  one  variable  time  series  via 
a  fully  automatic  process.  The  algorithm  was  programmed  for  the  computer 
and  tested  empirically .  It  was  found  that  USA  forecasts  were  not  statis- 
tically different  than  those  generated  by  conventional  modeling  procedures. 

The  results  indicate  that  there  is  evidence  that  the  Box-Jenkins 
modeling  process  can  be  fully  automated.  It  is  felt  that  the  use  of  USA 
can  (1)  increase  the  reproducibility  of  research,  (2)  save  time,  (3)  be 
used  as  a  **black  box"  by  the  statistically  untrained,  and  (4)  make  ex- 
plicit the  assumptions  employed  in  the  modeling  process. 


M/B/96 
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Notes 

Some  examples  of  the  use  of  Box- Jenkins  models  are:  Albrecht, 
Lookabill  (1977),  Brown  and  Rozeff  (1977),  Dopuch  and  Watts  (1972), 
Foster  (1977),  Lorek,  McDonald  and  Patz  (1976),  and  Watts  and  Leftwich 
(1977). 

2 

For  a  complete  discussion  see  Dharan  (1977). 

3 

For  purposes  of  the  present  study,  a  value  of  3  was  assumed  for  K. 

4 
For  purposes  of  the  present  study  a  value  of  1  was  assumed  for  L. 

For  purposes  of  the  present  study,  a  value  of  3  was  assumed  for  ra. 

For  purposes  of  the  present  study,  a  value  of  3  was  assumed  for  n. 

For  purposes  of  the  present  study,  a  value  of  1  was  assumed  for  P. 

8 
The  reason  that  the  model  identified  is  added  as  a  factor  can  be 

stated  by  the  following: 

Assume  the  following  ARIMA  model  for  Z 

(1)  <j>(B)  Z  ■  6(B)  a  where  a  is  the  nonrandom  residual 
series  which  can  be  modeled  by 

(2)  <f>»(B)  at  -  9'(B)  a*  where  a£  is  white  noise. 

(3)  Then  from  (2),  afc  =  ^(B)*  8'(B)  a'  and  substituting 
the  right  hand  side  of  (3)  into  (1)  we  get 

(4)  $(B)  Zt  =  6(B)  ^(B)"1  e'(B)  a» 
or 

(5)  <t>'(B)  6(B)  Zt  =  9'(B)  6(B)  a^ 
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Note  that  (5)  is  the  same  as  (1)  but  the  factors  of  the  noise  model  have 
been  added.  Typically  we  would  find  $(B)  to  be  a  regular  autoregressive 
factor,  4»'(B)  to  be  a  seasonal  autoregressive  factor,  8(B)  to  be  a  regular 
moving  average  factor  and  9'(B)  to  be  a  seasonal  moving  average  factor. 
For  a  discussion  of  multif actor  models  see  Box  and  Jenkins  (197Q,  Chapter  9) 

9 
In  those  cases  where  more  than  one  model  is  tried,  USA  will  select 

the  one  with  the  smaller  mean  square  error  (where  the  denominator  is  ad- 
justed for  the  appropriate  number  of  degrees  of  freedom) . 

See  Appendix  1  for  a  list  of  sample  firms.  Also  EPS  was  taken 
from  Moody's  Handbook  and  adjusted  for  changes  in  capital  structure.  In 
addition  for  firms  1,  2,  3,  5,  7,  12,  14,  15,  16,  18,  20,  21,  22,  24,  25, 
and  29,  EPS  were  computed  using  information  from  schedule  B-3  of  the  Civil 
Aeronautics  Board  (CAB)  form  41  in  conjunction  with  the  CAB  quarterly 
periodical  Air  Carrier  Financial  Statistics. 

The  models  are  not  presented  for  sake  of  space,  but  will  be  fur- 
nished upon  written  request  to  the  author. 

12 

See  Appendix  2  for  a  description  of  the  periods  used  for  modeling 

and  forecasting. 

13 

The  expected  frequencies  for  periods  7-10  have  been  adjusted 

slightly  due  to  a  small  amount  of  missing  data. 
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APPENDIX  1 
LIST  OF  SAMPLE  FIFMS 


1.  Airlift  International 

2.  Alaska  Airlines 

3.  Aloha  Airlines 

4.  American  Airlines 

5.  Aspen  Airways 

6.  Braniff  Airways 

7.  Caribbean  Atlantic  Airlines 

8.  Continental  Airlines 

9.  Delta  Airlines 

10.  Eastern  Airlines 

11.  Tiger  International  Airlines 

12.  Frontier  Airlines 

13.  Hawaiian  Airlines 

14.  National  Airlines 

15.  New  York  Airways 

16.  North  Central  Airlines 

17.  North  West  Airlines 

18.  Ozark  Airlines 

19.  Pan  American  Airways 

20.  Piedmont  Airlines 

21.  Reeve  Airlines 

22.  SFO  Airlines 

23.  Seaboard  World  Airlines 

24.  Southern  Airways 

25.  Texas  International  Airlines 

26.  Trans  World  Airlines 

27.  UAL  (United  Airlines) 

28.  Western  Airlines 

29.  Wien  Airlines 

30.  Allegheney  Airlines 


Each  firm  will  be  subsequently  referred  to  by  the  identifying  number 
that  precedes  it. 


APPENDIX  2 


DESCRIPTION  OF  AVAILABLE  DATA  FOR 
FORECAST  ERROR  ANALYSIS 


This  appendix  gives  a  firm  by  firm  description  of  tha  number  of 
quarters  of  data  available  for  forecast  error  analysis.  For  each  firm 
the  number  of  periods  in  the  base  period,  the  origin  date  for  forecasting, 
and  the  number  of  absolute  forecast  errors  is  presented. 


firm 

number  of  periods 

number 

in  base  period 

1 

50 

2 

50 

3 

50 

4 

50 

5 

30 

6 

30 

7 

40 

8 

50 

9 

50 

10 

50 

11 

50 

12 

50 

13 

50 

14 

50 

15 

50 

16 

50 

17 

50 

18 

50 

19 

50 

20 

50 

21 

50 

22 

42 

23 

50 

24 

50 

25 

50 

26 

50 

27 

50 

28 

50 

29 

30 

30 

50 

origin  date  for 
forecasting 

2/74 
2/74 
2/74 
2/74 
2/74 
3/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/74 
2/76 
2/74 


number  of  steps  ahead 
forecast  error  was  computed 

10 
10 
10 
10 
10 

8 

9 

7 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 

7 
10 


For  example  in  the  case  of  firm  1,  50  quarters  of  data  were  used  in 
transfer  and  univariate  estimation,  and  actual  and  predicted  forecasts 
were  computed  over  a  10  period  forecast  horizon  with  the  first  forecast 
being  for  the  third  quarter  of  1974. 


U  ""s"  OF  ILLINOIS-unBANA 


3  0112  060296438 


